Performance Evaluation of L1-norm-based Microarray Missing Value Imputation
نویسندگان
چکیده
l1-norm minimization was utilized in the imputation of microarray missing values, which is an important procedure in bioinformatics experiments. Two l1 approaches, based on the framework of local least squares (LLS) and iterative biclusterbased least squares (bicluster-iLLS) respectively, were employed. Imputed datasets of the l1 approaches were compared with those of traditional l2 methods. The imputation error rates showed that the assumption of sparsity is not supported by the microarray datasets. Singular value decompositions in biclusters and in the neighborhoods of target genes were computed to show the structure of a microarray dataset. The coefficients of l1 minimization solutions were also analyzed to reveal possible reasons for the performance of l1 approaches. Keywords-microarray missing value imputation; sparse representation; l1 minimization
منابع مشابه
A Review on Missing Value Imputation Algorithms for Microarray Gene Expression Data
Missing values has been a common problem in gene expression studies and have a significance effect on the interpretation of the final data. Many bioinformatics analysis tools especially for cancer classification and prediction require complete sets of data matrix. Therefore, development of missing value imputation algorithms is required to solve this particular problem. In this paper, we presen...
متن کاملDealing with missing values in large-scale studies: microarray data imputation and beyond
High-throughput biotechnologies, such as gene expression microarrays or mass-spectrometry-based proteomic assays, suffer from frequent missing values due to various experimental reasons. Since the missing data points can hinder downstream analyses, there exists a wide variety of ways in which to deal with missing values in large-scale data sets. Nowadays, it has become routine to estimate (or i...
متن کاملEffects of Missing Value Imputation on Down-stream Analyses in Microarray Data
Amongst the high-throughput technologies, DNA microarray experiments provide enormous quantity of genes and arrays with biological information to disease. The studies of gene expression values in various conditions and various organisms in public health have led to the identification of genes to the comparison between tumor and normal, clinically relevant subtypes of tumor, and prognostic signa...
متن کاملMissing Value Estimation of Epistatic Miniarray Profiling Data by Kernel Pca Regression Ensemble Approach
Missing data imputation is a key issue in learning from incomplete data. Various techniques have been developed with great success on dealing with missing values in data sets with heterogeneous attributes (their independent attributes are of different types) referred to as imputing mixed-attribute data sets. Epistatic miniarray profiling (E-MAP) is a powerful tool for analyzing gene functions a...
متن کاملMissing data imputation in multivariable time series data
Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012